Constructing Finite State Machines for Fast Gesture Recognition

نویسندگان

  • Pengyu Hong
  • Thomas S. Huang
  • Matthew Turk
چکیده

This paper proposes an approach to 2D gesture recognition that models each gesture as a Finite State Machine (FSM) in spatial-temporal space. The model construction works in a semi-automatic way. The structure of the model is first manually decided based on the observation of the spatial topology of the data. The model is refined iteratively between two stages: data segmentation and model training. Given the continuous training data of a single gesture, we roughly segment the gesture trajectory into phrases using the spatial information alone. The segmentation results are used to initialize an FSM. The model is used to re-segment the data. The results of the re-segmentation are used to refine the parameters of the model. After the FSM is trained, we incorporate a modified Knuth-Morris-Pratt algorithm into the FSM recognition procedure to speed up the gesture recognition. The computational efficiency of the FSM recognizers allows real-time on-line performance to be achieved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

How Finite State Machines Can Be Used to Build Error Free Multimodal Interaction Systems

Recognition-based interaction technologies (e.g. speech and gesture recognition) are still error-prone. It has been shown that, in multimodal architectures, combining complementary input modes can contribute to automatic recovery from recognition errors. However, the degree to which error recovery can be achieved is dependent on the design of the interaction, i.e. on the set of multimodal const...

متن کامل

Gesture Modeling and Recognition Using Finite State Machines

This paper proposes a state based approach to gesture learning and recognition. Using spatial clustering and temporal alignment, each gesture is defined to be an ordered sequence of states in spatial-temporal space. The 2D image positions of the centers of the head and both hands of the user are used as features; these are located by a color based tracking method. From training data of a given ...

متن کامل

Virtual Document Projector Camera

This paper describes techniques for the design of a system able to interact with the user by visual recognition of hand gestures. The system is composed of three modules including tracking, posture classii-cation and gesture recognition. A description of each module is given. In order to increase the robustness and the precision of the tracking, several complementary tracking processes are coup...

متن کامل

A hand gesture recognition technique for human-computer interaction

We propose an approach to recognize trajectory-based dynamic hand gestures in real time for human–computer interaction (HCI). We also introduce a fast learning mechanism that does not require extensive training data to teach gestures to the system. We use a six-degrees-of-freedom position tracker to collect trajectory data and represent gestures as an ordered sequence of directional movements i...

متن کامل

A Toolkit for Creating and Testing Multimodal Interface Designs

Designing and implementing applications that can handle multiple recognition-based interaction technologies such as speech and gesture inputs is a difficult task. IMBuilder and MEngine are the two components of a new toolkit for rapidly creating and testing multimodal interface designs. First, an interaction model is specified in the form of a collection of finite state machines, using a simple...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000